Accessibility settings

Published on in Vol 13 (2026)

This is a member publication of University of Exeter (Jisc)

Preprints (earlier versions) of this paper are available at https://preprints.jmir.org/preprint/87128, first published .
Validity and Reliability of the Track-UL Algorithm Compared With Kinovea Software for Measuring Upper-Limb Functional Range of Motion in People After Stroke: Cross-Sectional Observational Study

Validity and Reliability of the Track-UL Algorithm Compared With Kinovea Software for Measuring Upper-Limb Functional Range of Motion in People After Stroke: Cross-Sectional Observational Study

Validity and Reliability of the Track-UL Algorithm Compared With Kinovea Software for Measuring Upper-Limb Functional Range of Motion in People After Stroke: Cross-Sectional Observational Study

1Medical School, Faculty of Health and Life Sciences, University of Exeter, St Luke's Campus, Exeter, United Kingdom

2Basic Science Department, Faculty of Physical Therapy, Cairo University, Giza, Egypt

3Faculty of Health and Life Sciences, NIHR Exeter Biomedical Research Centre, University of Exeter, Exeter, United Kingdom

4School of Public Health and Sport Science, Faculty of Health and Life Sciences, University of Exeter, Exeter, United Kingdom

Corresponding Author:

Hatem Lazem, MSc


Background: Approximately 70% of survivors of stroke have problems with arm function. Physiotherapists assess arm functional range of motion (ROM) using either a goniometer or functional questionnaires, which lack objective accuracy and require a skilled physiotherapist. We developed the Track-UL algorithm based on a markerless motion capture system to measure arm ROM.

Objective: This study aimed to measure the agreement between our novel Track-UL algorithm and Kinovea software in assessing arm ROM during functional tasks in the laboratory and home settings.

Methods: Videos were recorded while 27 survivors of chronic stroke performed 4 functional tasks (forward reaching, arm abduction, moving the hand toward the mouth, and moving the hand toward the head) in the laboratory and at home. The videos were analyzed by 2 independent raters using the Track-UL algorithm and Kinovea software. The limits of agreement and intraclass correlation coefficients were calculated.

Results: We found no clinically significant systematic bias in shoulder and elbow angle, with good agreement between the Track-UL algorithm and Kinovea software (assessed via Bland-Altman plots). The 95% limits of agreement were –3.18 to 6.41 degrees for the shoulder joint and −5.35 to 8.78 degrees for the elbow joint in the laboratory setting, and –6.21 to 3.62 degrees for the shoulder joint and −4.06 to 2.53 degrees for the elbow joint in the home setting. There was excellent absolute agreement between the measurement tools across all tasks and joints; intraclass correlation coefficient values ranged from 0.97 (95% CI 0.97-0.99) to 0.99 (95% CI 0.99-0.99; P<.001 for both laboratory and home measurements).

Conclusions: The novel Track-UL algorithm is an accurate, valid, and easy tool that can be used to assess upper-limb ROM in survivors of stroke at clinics and potentially at home. This will support physiotherapists in remotely monitoring and adapting rehabilitation programs.

JMIR Rehabil Assist Technol 2026;13:e87128

doi:10.2196/87128

Keywords



Stroke is the second leading cause of death and the third leading cause of long-term disability for approximately 70% of people who experience a stroke, affecting their quality of life [1]. Approximately 40% of people with stroke experience upper-limb problems at the chronic stage of recovery [2]. Following a stroke, people can experience muscle weakness [3], spasticity [4], and changes in the pattern of arm mobility on the affected side [5].

According to the National Institute for Health and Care Excellence guidelines, physiotherapists routinely design rehabilitation programs to improve upper-limb function and, therefore, need to efficiently and effectively assess and monitor improvements in the functional range of motion (ROM) in people with stroke [6]. In clinical settings, physiotherapists typically use a clinical goniometer [7] or validated clinical questionnaires such as the Fugl-Meyer Assessment for the upper extremity (FMA-UE) [8] to evaluate and monitor the upper-limb ROM in people with stroke.

A goniometer is the current standard tool for assessing ROM in physiotherapy practice [9]. However, this requires an adequate level of expertise from the practitioner to execute, so its precision can be influenced by clinical skills and experience [10,11]. In addition, the presence of compensation during upper-limb movements, such as shoulder and trunk rotation, and the disruption of elbow-shoulder coordination can affect the ease with which goniometers can be used to measure ROM [12]. A limitation of upper-limb clinical assessment questionnaires such as the FMA-UE scale is the time required to complete them, which can be up to 25 minutes. They also rely on a Likert-based rating system in which scores of 0, 1, and 2 indicate no movement, partial movement, and full movement, respectively, which is discrete and bounded, depends on the raters’ experience, and lacks the precision in quantifying progression and monitoring small changes in arm mobility over time [8].

An alternative approach is the use of affordable and free 2D markerless motion analysis tools such as the Kinovea software that allow therapists to measure the upper-limb kinematics from video footage [13]. However, such tools are better suited to research as the time and expertise of an experienced health care professional required for operation make them less accessible to physiotherapists in clinics. Computer vision algorithms based on markerless motion capture and pose estimation frameworks offer a potential solution for faster and easier measurement of the ROM without the need for an experienced health care professional [14]. So far, computer vision–based algorithms have been validated for people with ankylosing spondylitis [15] and following knee replacement [16] but are still in need of validation among people with stroke, who have different movement complexities such as compensatory movement patterns and reduced ROM, which differ from the normative datasets on which most algorithms are trained, which can introduce the possibility of tracking errors [12,17,18].

Therefore, based on the gap in the current literature on the validity and reliability of the pose estimation algorithms in stroke, the aim of this study was to assess the concurrent validity and test-retest reliability of a novel Track-UL algorithm based on MediaPipe, a pose estimation framework, as an objective and easy-to-use tool for measuring and monitoring upper-limb mobility in people with stroke in the laboratory and home settings.

The main objectives of this study were as follows:

  • To investigate the concurrent validity of the Track-UL algorithm based on a 2D pose estimation framework compared to the Kinovea software for measuring shoulder and elbow joint functional active ROM
  • To investigate the test-retest reliability of the Track-UL algorithm based on a 2D pose estimation framework for measuring shoulder and elbow joint functional active ROM
  • To explore the potential for using the Track-UL algorithm as a telemonitoring tool for measuring upper-limb functional active ROM from videos recorded by people with stroke in their own homes

Ethical Considerations

This study was conducted in accordance with the ethical standards of the responsible institutional and national committees on human experimentation and with the World Medical Association Declaration of Helsinki (1975, as revised in 2013). This project was reviewed and approved by an independent National Health Service research ethics committee and the Health Research Authority (reference 24/LO/0434). Written informed consent was obtained from all participants prior to their inclusion in the study. The privacy and confidentiality of participants’ data and identities were strictly maintained. Personally identifiable information was stored securely and separately from research data, with access limited to authorized members of the research team. Participants received compensation for travel expenses associated with study participation.

Participants

This was a cross-sectional observational study in which a total of 27 people with stroke participated. Data collection was conducted in the laboratory by the researcher (a physiotherapist) at the University of Exeter and participants’ homes. We recruited people with stroke aged 18 years or older who were more than 6 months after stroke with a score of 2 or lower on the Modified Ashworth Scale and any degree of upper-limb impairment resulting from stroke (FMA-UE score<57). Survivors of stroke were excluded if they had severe cognitive deficits and could not follow the instructions given by the researcher or had severe spasticity (score of >2 on the Modified Ashworth Scale). The reason for these exclusions was that these patient groups are not able to achieve active movements, which are a requirement of the tasks this study aimed to validate.

Procedure

All participants completed a single face-to-face laboratory testing session at the University of Exeter with a physiotherapist, which lasted up to 60 minutes, including collection of demographic data and assessment of upper-limb ROM, with videos recorded while participants performed 4 functional upper-limb tasks (Figure 1). Participants were then asked to repeat the same 4 tasks alone at home with or without support from their carers following written instructions provided by the research team while recording videos of the tasks within 3 days following the initial assessment.

Figure 1. The 4 functional complex upper-limb tasks.

To familiarize participants with the tasks, each movement was demonstrated to them, who then practiced it twice before starting. Participants were asked to wear light clothes with sleeveless shirts to aid identification of bony landmarks on the upper limb. Before recording the videos, colored circular stickers (markers) were applied to predefined bony landmarks of the upper limb (acromion process, anterior and midthoracic line, lateral epicondyle, midshaft of the humerus, midforearm, and midwrist). The use of these markers enabled measurement of shoulder and elbow range of movements during the 4 tasks (Figure 1). For the videos recorded at home by the participants or their carers, we used virtual markers implemented in the Kinovea software.

All videos were imported into a laptop and analyzed using both the Kinovea software and PyCharm Community Edition (version 2022.3.1; JetBrains) using the Track-UL algorithm, a function programmed in PyCharm. This function calculates the angle of joints of the human body based on the joint coordinates using the 2D vector dot product formula. All videos were recorded by 1 physiotherapist. Two physiotherapists (one of whom recorded the videos) who were blinded to each other’s measurements then independently analyzed the video recordings: one using the Kinovea software and the other using the Track-UL algorithm.

For home testing, we asked the participants themselves or their carers to record the videos only for the affected side within 3 days of the laboratory test and then send the videos back to the researcher.

Instruments

Kinovea is free and open-source analysis software used by health care professionals in research and clinical settings. It is considered to be a valid and reliable tool for measuring ROM using a digital goniometer, which is widely used in clinical and sports biomechanics for 2D kinematic analysis [19,20]. While the gold standard for measuring ROM is the 3D motion capture system, using Kinovea allows for a more applicable comparison aligned with the intended use of this algorithm in real-world environments such as a clinic or the home setting, in which stroke assessment and telerehabilitation usually occur. Analysis of the captured videos was carried out using Kinovea (version 0.9.5; Joan Charmant). Circular colored markers were placed on the bony landmarks of the upper limb prior to capture to facilitate calculation of upper-limb joint angles using a digital goniometer.

MediaPipe is an open-source pose estimation model developed by Google for machine learning applications [21] (Figure 2 [22]). The model infers 33 pose landmarks from the whole body; offers real-time processing and compatibility with standard video input from commonly available devices such as mobile phones; and offers a practical balance between accuracy and usability [23] compared to other pose estimation software such as OpenPose [24] and OpenCap [25] that require high computational resources and more complex setup, which can limit their applicability in a clinical setting or at home. The Track-UL algorithm was developed using PyCharm for automatic calculation of both shoulder and elbow joint angles from videos, which are then presented to the user. To calculate joint angles, coordinates for bony landmarks (hip, shoulder, elbow, and wrist) were extracted from video frames using the MediaPipe pose estimation framework. Each landmark provides 2D coordinates (x, y). Shoulder and elbow joint angles were computed by treating the connected rigid body segments (trunk, upper arm, and forearm) as 2D vectors (Figures 2 and 3 [22]).

Figure 2. BlazePose model of 33 human poses [22].
Figure 3. How the video appears to the physiotherapist when we apply the Track-UL algorithm.

A 2D vector dot product formula was used to calculate shoulder joint angle (term B in the following formulas) using coordinates of the shoulder joint and the adjacent hip (term A) and elbow (term C) joints:

θ=arccosA-BC-BA-BC-B
θ=arrco s(((xAxB)(xCxB)+(yAyB)(yCyB))/(((xAxB)2+(yAyB)2)×((xCxB)2+(yCyB)2)))

Video Processing

The researcher recorded 8 videos of 4 different upper-limb functional tasks performed on each side (affected and unaffected side) using a mobile phone camera (iPhone 13 Pro Max [Apple Inc] with a 12-MP main camera with sensor-shift stabilization, 1.9-µm pixels, and a 26-mm equivalent f/1.5-aperture lens) and a tripod. The camera was placed 1.5 m from the participants on a tripod at a height of 90 cm oriented with an angle of 90 degrees perpendicular to the chair. Participants were then asked to record the same videos at home following standardized instructions (Multimedia Appendix 1) within the following 3 days and only for the affected side to lower the risk of fatigue from repeated movements. Each task was repeated 3 times independently in each video with 10 seconds of rest in between each repetition and one 1-minute rest in between each task. During analysis, there were no apparent challenges regarding landmark detection by the Track-UL algorithm.

This study measured the functional active ROM for both the shoulder joint (flexion and abduction) and the elbow joint (flexion and extension) during 4 different functional complex tasks, which are routinely used in the clinic as part of the FMA-UE assessment. We measured both the affected and unaffected sides to understand the extent to which the Track-UL algorithm can estimate the functional active ROM accurately on the unaffected side as well as the affected side, which can include some substitute movements due to the motor impairments compared to the unaffected side.

Analysis

We used SPSS Statistics (version 29; IBM Corp) for data analysis. To assess the Track-UL algorithm’s concurrent validity against Kinovea, absolute agreement was evaluated using Bland-Altman analysis [26] to obtain the 95% limit of agreement (LoA) and mean bias metrics using the following equation: LoA = mean difference ± 1.96 × SD difference.

To measure the absolute agreement between our algorithm and Kinovea, we calculated the intraclass correlation coefficient (ICC) based on a 2-way random-effect model for a single measurement with an absolute agreement model (ICC(2,1)). This model accounts for both participant variability and systematic differences between the 2 assessment tools.

To assess the test-retest reliability for the Track-UL algorithm, we also calculated the ICC(2,1) for the repeated measures, the SE of measurement, and the minimum detectable change using the following equations [27,28]:

SEM=StDev1ICC
MDC=SEM2×1.96

ICC values between 0.81 and 1.0 are interpreted as very good or excellent, ICC values between 0.61 and 0.80 are interpreted as good, ICC values between 0.41 and 0.60 are interpreted as moderate, ICC values between 0.21 and 0.40 are interpreted as fair, and ICC values below 0.20 are interpreted as poor [29].


Overview

The demographic data of the participants are listed in Table 1.

Table 1. Participants’ demographic data (N=27).
CharacteristicsParticipants
Age (years), mean (SD; range)60.56 (13.82; 26-88)
Time since stroke (months), mean (SD; range)42.93 (33.07; 6-126)
FMA-UEa score, mean (SD; range)32.15 (16.21; 10-55)
MASb score—shoulder, mean (SD; range)1.22 (0.42; 1-2)
MAS score—elbow, mean (SD; range)1.44 (0.51; 1-2)
MAS score—hand, mean (SD; range)1.41 (0.50; 1-2)
Sex, n (%)
Male21 (77.8)
Female6 (22.2)
Type of stroke, n (%)
Ischemic19 (70.4)
Hemorrhagic8 (29.6)
Affected side, n (%)
Right16 (59.3)
Left11 (40.7)

aFMA-UE: Fugl-Meyer Assessment for the upper extremity.

bMAS: Modified Ashworth Scale.

Concurrent Validity for Laboratory Measurement

The Bland-Altman plots (Figures 4 and 5) reveal that the mean difference in shoulder and elbow joint angles between the Track-UL algorithm and Kinovea was close to 0, with a mean value ranging from −0.34 (SD 1.13) to 0.76 (SD 1.25) degrees for the unaffected side and −1.24 (SD 3.84) to 1.17 (SD 2.67) degrees for the affected side, indicating no systematic bias. The 95% LoA for joint measurements on the unaffected side ranged from −2.88 to 3.03 degrees for the shoulder and from −3.41 to 3.21 degrees for the elbow (Table 2). On the affected side, the 95% LoA ranged from −3.18 to 6.41 degrees for the shoulder and −5.35 to 6.29 degrees for the elbow (Table 2). This difference is not considered clinically significant [30]. There are a few outliers in some of the graphs; however, there is no clear evidence of a trend or heteroscedasticity in the plots except for the shoulder joint during shoulder abduction (task 1) on the affected side. In this case, a proportional bias was observed: as the abduction angle increased, the positive difference also increased. However, this positive difference of up to 6.41 degrees does not have a clinically significant impact on the measurement of the shoulder functional active ROM [30].

Figure 4. Bland-Altman plots for comparing the Track-UL algorithm and Kinovea software for the shoulder joint in both the unaffected and affected side in the laboratory setting.
Figure 5. Bland-Altman plots for comparing the Track-UL algorithm and Kinovea software for the elbow joint in both the unaffected and affected side in the laboratory setting.
Table 2. Mean values and limit of agreement (LoA) between the Track-UL algorithm and the Kinovea software based on the laboratory measurements.
Tested side and task numberElbow jointShoulder joint
Observations, nValues, mean (SD; 95% CI)Lower LoAUpper LoAObservations, nValues, mean (SD; 95% CI)Lower LoAUpper LoA
Unaffected side
1a — — —810.21 (1.45; –0.12 to 0.52)−2.643.03
281−0.16 (1.65; –0.52 to 0.21)−3.413.0880−0.09 (1.43; –0.40 to 0.23)−2.882.71
3810.76 (1.25; 0.48 to 1.03)−1.693.2181−0.02 (1.06; –0.26 to 0.21)−2.102.06
481−0.11 (1.05; –0.34 to 0.11)−2.191.9581−0.34 (1.13; –0.59 to –0.09)−2.561.87
Affected side
184 — — —841.17 (2.67; 0.59 to 1.75)−4.066.41
284−1.24 (3.84; –2.07 to –0.40)−8.786.2984−0.01 (2.58; –0.57 to 0.55)−5.075.05
384−0.07 (1.23; –0.34 to 0.18)−2.492.33840.02 (1.63; –0.34 to 0.37)−3.183.22
484−0.77 (2.33; –1.27 to –0.26)−5.353.8184−0.23 (1.56; –0.57 to 0.11)−3.312.84

aNot applicable.

Absolute Agreement Analysis for Laboratory Measurement

We used the ICC to assess the absolute agreement between the Track-UL algorithm and the Kinovea software when measuring ROM from videos recorded in the laboratory. The ICC(2,1) for the absolute agreement was calculated, showing excellent agreement across all tasks, joints, and sides (unaffected and affected). The ICC values ranged from 0.98 to 0.99 for the unaffected side and from 0.98 to 0.99 for the affected side (Table 3). This indicates that measurements were very consistent between the 2 assessment tools.

Table 3. Intraclass correlation between the Track-UL algorithm and the Kinovea software (laboratory setting).
Side and task numberJointIntraclass correlation coefficient (95% CI)F test (df)P value
Unaffected side
1Shoulder0.99 (0.99-0.99)714.557 (80, 80)<.001
2Shoulder0.98 (0.98-0.99)156.212 (80, 80)<.001
2Elbow0.98 (0.98-0.99)156.212 (80, 80)<.001
3Shoulder0.99 (0.99-0.99)879.210 (80, 80)<.001
3Elbow0.99 (0.99-0.99)1027.538 (80, 80)<.001
4Shoulder0.99 (0.99-0.99)473.902 (80, 80)<.001
4Elbow0.99 (0.99-0.99)481.433 (80, 80)<.001
Affected side
1Shoulder0.99 (0.99-0.99)455.606 (83, 83)<.001
2Shoulder0.99 (0.99-0.99)677.202 (83, 83)<.001
2Elbow0.98 (0.98-0.99)164.610 (83, 83)<.001
3Shoulder0.99 (0.99-0.99)1001.732 (83, 83)<.001
3Elbow0.99 (0.98-0.99)216.907 (83, 83)<.001
4Shoulder0.99 (0.98-0.99)260.500 (83, 83)<.001
4Elbow0.99 (0.99-0.99)959.581 (83, 83)<.001

Track-UL Algorithm Test-Retest Reliability for Laboratory Measurement

The Track-UL algorithm reliability results for both the shoulder and elbow joints on both sides are shown in Table 4. The ICC showed that the Track-UL algorithm had excellent reliability (all ICC values>0.81).

Table 4. Track-UL algorithm test-retest reliability for the laboratory measurements (N=27).
Tested side and task numberJointICCaSEMbMDCc
Affected side
2Elbow0.955.3214.77
3Elbow0.9111.7832.65
4Elbow0.957.4120.55
1Shoulder0.974.1111.41
2Shoulder0.984.7513.19
3Shoulder0.936.5618.19
4Shoulder0.884.2011.65
Unaffected side
2Elbow0.843.8310.62
3Elbow0.915.8216.15
4Elbow0.932.908.04
1Shoulder0.925.1914.39
2Shoulder0.897.2019.96
3Shoulder0.875.4315.07
4Shoulder0.971.875.20

aICC: intraclass correlation coefficient.

bSEM: SE of measurement.

cMDC: minimum detectable change.

Concurrent Validity for Home Measurement

Only 48.1% (13/27) of the participants were able to record videos at home that were fit for analysis. For the acceptable videos, the Bland-Altman plots (Figures 6 and 7) indicated that the mean difference in the shoulder and elbow joint angles between the Track-UL algorithm and Kinovea software was close to 0; the mean values ranged from −1.29 to 0.39 degrees, indicating no systematic bias. The 95% LoA ranged from −6.21 to 3.62 degrees for shoulder joint measurements and from −4.06 to 2.53 degrees for elbow joint measurements. This difference is not considered clinically significant [30] (Table 5).

Figure 6. Bland-Altman plots for comparing the Track-UL algorithm and Kinovea software for the shoulder joint in the affected side in the home setting.
Figure 7. Bland-Altman plots for comparing the Track-UL algorithm and Kinovea software for the elbow joint in the affected side in the home setting.

There was no clear evidence of a trend or heteroscedasticity on the plots except for the shoulder joint during shoulder elevation (task 2), which displayed a proportional bias whereby the positive difference increased at higher angles of elevation and the negative difference increased at lower angles. However, the positive difference of up to 3.45 degrees and the negative difference of up to −2.66 degrees had no substantial clinical influence on measuring shoulder functional active ROM [30] (Table 5).

Table 5. The limit of agreement (LoA) between the Track-UL algorithm and the Kinovea software from videos recorded at home.
Tested side and task numberElbow jointShoulder joint
Observations, nMean (SD; 95% CI)Lower LoAUpper LoAObservations, nMean (SD; 95% CI)Lower LoAUpper LoA
Affected side
1a39−1.29 (2.51; –2.11 to –0.47)−6.213.62
239−1.25 (1.39; –1.71 to –0.81)−3.991.47390.39 (1.56; –0.11 to 0.89)−2.663.45
339−.63 (1.61; –1.51 to –0.11)−3.792.5339−0.22 (1.69; –0.77 to 0.32)−3.543.09
439−1.01 (1.56; –1.51 to –0.49)−4.062.0539−0.27 (1.36; –0.71 to 0.17)−2.952.41

aNot applicable.

Absolute Agreement Analysis for Home Measurement

The absolute agreement between the 2 measurement tools (Track-UL algorithm and Kinovea software) for videos recorded at home was assessed using the ICC. The results demonstrated excellent reliability across all evaluated tasks and joints. ICC values ranged from 0.97 to 0.99, indicating a high degree of consistency between the 2 assessment tools (Table 6).

Table 6. Intraclass correlation between the Track-UL algorithm and the Kinovea software (home setting).
Side and task numberJointIntraclass correlation coefficient (95% CI)F test (df)P value
Affected side
1Shoulder0.99 (0.97-0.99)284.85 (38, 38)<.001
2Shoulder0.99 (0.99-0.99)2091.99 (38, 38)<.001
2Elbow0.99 (0.98-0.99)1772.51 (38, 38)<.001
3Shoulder0.97 (0.95-0.98)89.19 (38, 38)<.001
3Elbow0.99 (0.98-0.99)260.63 (38, 38)<.001
4Shoulder0.99 (0.99-0.99)467.71 (38, 38)<.001
4Elbow0.99 (0.99-1.00)2508.41 (38, 38)<.001

Track-UL Algorithm Test-Retest Reliability for Home Measurement

To assess the test-retest reliability of the Track-UL algorithm from the videos recorded at home, we collected 3 independent measures for each task. The ICC showed that the Track-UL algorithm had good to excellent reliability results for both the shoulder and elbow joints on the affected side (all ICC values≥0.778; Table 7).

Table 7. Track-UL algorithm test-retest reliability for the home measurement (n=13).
Tested side and task numberJointICCaSEMbMDCc
Affected side
2Elbow0.7713.0736.24
3Elbow0.985.9416.48
4Elbow0.949.1225.28
1Shoulder0.973.238.95
2Shoulder0.992.296.35
3Shoulder0.964.5212.54
4Shoulder0.914.3011.92

aICC: intraclass correlation coefficient.

bSEM: SE of measurement.

cMDC: minimum detectable change.


Principal Findings

The main aims of this study were to measure the concurrent validity, absolute agreement, and test-retest reliability of the newly developed Track-UL algorithm. This algorithm uses a 2D markerless motion capture system based on MediaPipe, a pose estimation framework, to measure shoulder and elbow functional active ROM from recorded videos in people with stroke. While most algorithms that use MediaPipe pose estimation models have been tested and implemented in healthy populations for a variety of applications [31-33], such as assessment for spinal diseases and frozen shoulder [34], among people with tremors [35], and for telerehabilitation purposes [36,37], they have not yet been validated in individuals with stroke using complex functional tasks. The use of computer vision–based assessment of functional movement in people with stroke has the potential to improve the accessibility and scalability of movement assessment by providing accurate, low-cost, and markerless measurement outside of specialist laboratory settings. Therefore, we assessed shoulder and elbow functional active ROM during 4 complex functional tasks in both laboratory and home settings to explore the efficacy and effectiveness of this approach for individuals with stroke. We also present the possibilities of using this technique for telemonitoring in stroke rehabilitation.

For the videos recorded by the researcher in the laboratory, we observed a positive agreement between the Track-UL algorithm and the Kinovea software in the measurement of upper-limb function in survivors of stroke in 4 different tasks. The LoA was below the minimum clinically significant difference, which supports the use of the Track-UL algorithm in clinical practice to evaluate and monitor the progression of upper-limb kinematics. The LoA was slightly higher in certain movements on the affected side relative to the unaffected side, potentially due to the compensatory shoulder joint movements (internal rotation during arm elevation) and elbow joint movements (flexion and extension), which influenced the process of analysis.

We observed a significant degree of absolute agreement between the Track-UL algorithm and the Kinovea software in videos captured by both the physiotherapist and the participants. This finding supports the interchangeability of tools as the Track-UL algorithm performs as an intuitive and convenient tool for assessing arm mobility in both clinical and remote settings. We support the integration of this tool into the clinical assessment of upper-limb kinematics in individuals with stroke. This tool offers significant advantages, particularly for those who experience difficulties commuting to clinics or who reside in rural areas, by enabling remote follow-up and monitoring of upper-limb recovery. Furthermore, it provides more convenient solutions for carers, reducing their time and financial burdens as a result of traveling to clinics with their relatives with stroke. We can integrate this tool with any telerehabilitation model to provide a telerehabilitation and telemonitoring tool for people to use at home.

The test-retest reliability of the Track-UL algorithm demonstrated excellent consistency and robustness in repeated clinical assessments both in laboratory and home environments. The findings indicate that the Track-UL algorithm can deliver highly reliable kinematic measurements for recorded videos of people with stroke, rendering it appropriate for both in-clinic and remote assessments in stroke rehabilitation. This can be clinically beneficial, particularly for individuals reporting fatigue, allowing us to request that they perform the task only once rather than multiple times.

We calculated the minimum detectable change of the Track-UL assessment tool and determined that it was slightly elevated for the elbow joint compared to the shoulder joint, more pronounced on the affected side. This is potentially attributable to personal variability and the complexity of the tasks, which emphasize functional movements rather than simple joint movements. Additionally, the use of a 2D markerless motion capture system limits measurements to a single plane, making the results more sensitive to compensatory movements such as whole-arm rotation. Therefore, if this algorithm is to be used for monitoring upper-limb kinematics over time as a telemonitoring tool, the data should be interpreted with caution in clinical practice.

Our findings align with those of Latreche et al [31], who reported good agreement and reliability of an algorithm using the MediaPipe pose estimation framework in comparison to a goniometer; however, their study focused on simple shoulder movements. In our study, we explored complex functional arm movements involving the shoulder and elbow joints in people with stroke, which were close to what we usually evaluate in clinical practice. Another study measured upper-limb kinematics using sensorized gloves equipped with electromagnetic sensors capable of assessing 6 df during functional reach and grasp tasks [38]; however, the Track-UL algorithm–based measurement tool has the potential to be more user-friendly, requiring no additional equipment, which increases its suitability for clinical use.

Many participants were either unable to accurately record videos at home (4/27, 14.8%), potentially because restricted home environments, such as small houses, made it difficult to achieve the required camera angle, or they were unable to record the videos at all (10/27, 37%) due to technical difficulties in following the video recording instructions and then sending the videos back to the research team, particularly among older survivors. Consequently, we could not analyze the videos to determine whether the setup was incorrect, whether the entire movement was captured within the camera’s frame, or whether the camera was positioned arbitrarily, all of which could affect pose estimation analysis. All of these factors affected the ability to record the videos at home and the ability of the algorithm to analyze the joint angles from the videos recorded independently by the participants or their carers at home. We recommend integrating this algorithm into an application that can offer a step-by-step guide for video recording alongside real-time feedback on the video quality, such as the camera angle, the distance from the camera, and frame coverage, to enhance the feasibility of using this algorithm as a telemonitoring tool for people with stroke.

For the current application, which aimed to validate 4 functional active ROM tasks, 2D analysis is appropriate. However, there are known limitations, such as lack of depth information and issues with out-of-plane movement detection. These could potentially be addressed using 3D with 2D recorded videos to yield further depth of information that could be useful in other contexts. We also recommend incorporating more functional tasks into the assessment and cross-validating them with functional questionnaires (FMA-UE). In addition, time-series analysis could yield extra layers of useful clinical information, including qualitative considerations such as movement smoothness, which could add further insight in future studies.

Conclusions

The Track-UL algorithm can be adopted as a cost-effective, fast, and user-friendly alternative to traditional motion analysis software such as Kinovea. This allows for its use in both clinical and potentially in remote settings. This technology has the potential to benefit people with stroke through monitoring and motor performance assessments. We recommend developing an application with an easy-to-use interface that enables survivors of stroke to securely record and analyze their arm movement both in clinics and at home. Automated reports can provide physiotherapists with objective, measurable kinematic data to help design and adjust personalized rehabilitation programs.

Acknowledgments

This study was supported by the Egyptian Ministry of Higher Education and Scientific Research as part of a funded PhD project. SEL is supported by the Dennis and Mireille Gillings Foundation. The research was carried out at the National Institute for Health and Care Research Exeter Biomedical Research Centre.

Funding

This study was funded by the Egyptian government and supported by the National Institute for Health and Care Research Biomedical Research Centre at the University of Exeter. HL has received funding for his PhD from the Egyptian Cultural and Educational Bureau (ID: MM48/21). HL, TCR, PL, HD, MM, and SEL are supported by the National Institute for Health and Care Research Exeter Biomedical Research Centre. HD is supported by National Institute for Health and Care Research HealthTech Research Centre.

Data Availability

The datasets generated or analyzed during this study are available from the corresponding author on reasonable request.

Authors' Contributions

HL, DH, AH, SEL, MM, and HD conceptualized the methods and designed the study. HL collected the data from the participants. HL, PL, XX, PN, and TCR analyzed and interpreted the data. HL and HD wrote the initial manuscript, and all authors reviewed, edited, and approved the final manuscript.

Conflicts of Interest

MM is a Co-Founder and Director of DigiTherapix. HD is codirector of DigiTherapix. DigiTherapix is not related to the subject matter of the manuscript, and no company resources or funds were used in this work. All other authors declare no other conflicts of interest.

Multimedia Appendix 1

Instructions for recording the videos.

PDF File, 358 KB

  1. Donkor ES. Stroke in the 21st century: a snapshot of the burden, epidemiology, and quality of life. Stroke Res Treat. 2018;2018:3238165. [CrossRef] [Medline]
  2. Raghavan P. Upper limb motor impairment after stroke. Phys Med Rehabil Clin N Am. Nov 2015;26(4):599-610. [CrossRef] [Medline]
  3. Colebatch JG, Gandevia SC. The distribution of muscular weakness in upper motor neuron lesions affecting the arm. Brain. Jun 1989;112(Pt 3):749-763. [CrossRef] [Medline]
  4. O’Dwyer NJ, Ada L, Neilson PD. Spasticity and muscle contracture following stroke. Brain. 1996;119(Pt 5):1737-1749. [CrossRef] [Medline]
  5. Levin MF, Selles RW, Verheul MH, Meijer OG. Deficits in the coordination of agonist and antagonist muscles in stroke patients: implications for normal motor control. Brain Res. Jan 24, 2000;853(2):352-369. [CrossRef] [Medline]
  6. Tang E, Moran N, Cadman M, et al. Stroke rehabilitation in adults: summary of updated NICE guidance. BMJ. Mar 22, 2024;384:q498. [CrossRef] [Medline]
  7. de Jong LD, Dijkstra PU, Stewart RE, Postema K. Repeated measurements of arm joint passive range of motion after stroke: interobserver reliability and sources of variation. Phys Ther. Aug 2012;92(8):1027-1035. [CrossRef] [Medline]
  8. Gladstone DJ, Danells CJ, Black SE. The Fugl-Meyer assessment of motor recovery after stroke: a critical review of its measurement properties. Neurorehabil Neural Repair. Sep 2002;16(3):232-240. [CrossRef] [Medline]
  9. Hanks J, Myers B. Validity, reliability, and efficiency of a standard goniometer, medical inclinometer, and builder’s inclinometer. Int J Sports Phys Ther. 2023;18(4):989-996. [CrossRef] [Medline]
  10. Lea RD, Gerhardt JJ. Range-of-motion measurements. J Bone Joint Surg Am. May 1995;77(5):784-798. [CrossRef] [Medline]
  11. Akizuki K, Yamaguchi K, Morita Y, Ohashi Y. The effect of proficiency level on measurement error of range of motion. J Phys Ther Sci. Sep 2016;28(9):2644-2651. [CrossRef] [Medline]
  12. Cirstea MC, Levin MF. Compensatory strategies for reaching in stroke. Brain. May 2000;123(Pt 5):940-953. [CrossRef] [Medline]
  13. Elrahim RMA, Embaby EA, Ali MF, Kamel RM. Inter-rater and intra-rater reliability of Kinovea software for measurement of shoulder range of motion. Bull Fac Phys Ther. Dec 2016;21(2):80-87. [CrossRef]
  14. Hellsten T, Karlsson J, Shamsuzzaman M, Pulkkis G. The potential of computer vision-based marker-less human motion analysis for rehabilitation. Rehabil Process Outcome. 2021;10:11795727211022330. [CrossRef] [Medline]
  15. Hannink E, Mansoubi M, Cronin N, et al. Validity and feasibility of remote measurement systems for functional movement and posture assessments in people with axial spondylarthritis. Healthc Technol Lett. Dec 2022;9(6):110-118. [CrossRef] [Medline]
  16. Mansoubi M, Leveridge P, Smith M, et al. Knee4Life: empowering knee recovery after total knee replacement through digital health protocol. Sensors (Basel). Nov 17, 2024;24(22):7334. [CrossRef] [Medline]
  17. Wade L, Needham L, McGuigan P, Bilzon J. Applications and limitations of current markerless motion capture methods for clinical gait biomechanics. PeerJ. 2022;10:e12995. [CrossRef] [Medline]
  18. Aprile I, Rabuffetti M, Padua L, Di Sipio E, Simbolotti C, Ferrarin M. Kinematic analysis of the upper limb motor strategies in stroke patients as a tool towards advanced neurorehabilitation strategies: a preliminary study. Biomed Res Int. 2014;2014:636123. [CrossRef] [Medline]
  19. Puig-Diví A, Escalona-Marfil C, Padullés-Riu JM, Busquets A, Padullés-Chando X, Marcos-Ruiz D. Validity and reliability of the Kinovea program in obtaining angles and distances using coordinates in 4 perspectives. PLoS One. 2019;14(6):e0216448. [CrossRef] [Medline]
  20. Fernández-González P, Koutsou A, Cuesta-Gómez A, Carratalá-Tejada M, Miangolarra-Page JC, Molina-Rueda F. Reliability of Kinovea® software and agreement with a three-dimensional motion system for gait analysis in healthy subjects. Sensors (Basel). Jun 2, 2020;20(11):3154. [CrossRef] [Medline]
  21. Lugaresi C, Tang J, Nash H, et al. MediaPipe: a framework for perceiving and processing reality. Presented at: Third Workshop on Computer Vision for AR/VR at IEEE Computer Vision and Pattern Recognition (CVPR) 2019; Jun 17, 2019. URL: https://research.google/pubs/mediapipe-a-framework-for-perceiving-and-processing-reality/ [Accessed 2026-04-21]
  22. Bazarevsky V, Grishchenko I, Raveendran K, Zhu T, Zhang F, Grundmann M. BlazePose: on-device real-time body pose tracking. arXiv. Preprint posted online on Jun 17, 2020. [CrossRef]
  23. Akturk S, Derdiyok FB, Serbest K. Markerless joint angle estimation using MediaPipe with a rapid setup for joint moment calculation. Multimed Tools Appl. 2026;85:38. [CrossRef]
  24. Cao Z, Hidalgo G, Simon T, Wei SE, Sheikh Y. OpenPose: realtime multi-person 2D pose estimation using part affinity fields. IEEE Trans Pattern Anal Mach Intell. Jan 2021;43(1):172-186. [CrossRef] [Medline]
  25. Uhlrich SD, Falisse A, Kidziński Ł, et al. OpenCap: human movement dynamics from smartphone videos. PLoS Comput Biol. Oct 2023;19(10):e1011462. [CrossRef] [Medline]
  26. Martin Bland J, Altman DG. Statistical methods for assessing agreement between two methods of clinical measurement. Lancet. Feb 1986;327(8476):307-310. [CrossRef]
  27. Stratford PW, Binkley JM, Riddle DL. Health status measures: strategies and analytic methods for assessing change scores. Phys Ther. Oct 1996;76(10):1109-1123. [CrossRef] [Medline]
  28. Stratford PW, Binkley J, Solomon P, Finch E, Gill C, Moreland J. Defining the minimum level of detectable change for the Roland-Morris Questionnaire. Phys Ther. Apr 1996;76(4):359-365. [CrossRef] [Medline]
  29. Koo TK, Li MY. A guideline of selecting and reporting intraclass correlation coefficients for reliability research. J Chiropr Med. Jun 2016;15(2):155-163. [CrossRef] [Medline]
  30. Mullaney MJ, McHugh MP, Johnson CP, Tyler TF. Reliability of shoulder range of motion comparing a goniometer to a digital level. Physiother Theory Pract. Jul 2010;26(5):327-333. [CrossRef] [Medline]
  31. Latreche A, Kelaiaia R, Chemori A, Kerboua A. Reliability and validity analysis of MediaPipe-based measurement system for some human rehabilitation motions. Measurement. Jun 2023;214:112826. [CrossRef]
  32. Garg S, Saxena A, Gupta R. Yoga pose classification: a CNN and MediaPipe inspired deep learning approach for real-world application. J Ambient Intell Human Comput. Dec 2023;14:16551-16562. [CrossRef]
  33. Hisham MN, Hassan MF, Ibrahim N, Zin ZM. Mono camera-based human skeletal tracking for squat exercise abnormality detection using double exponential smoothing. Int J Adv Comput Sci Appl. 2022;13(7). [CrossRef]
  34. Zhang W, Li Y, Cai S, et al. Combined MediaPipe and YOLOv5 range of motion assessment system for spinal diseases and frozen shoulder. Sci Rep. 2024;14:15879. [CrossRef]
  35. Friedrich MU, Roenn AJ, Palmisano C, et al. Validation and application of computer vision algorithms for video-based tremor analysis. NPJ Digit Med. Jun 21, 2024;7(1):165. [CrossRef] [Medline]
  36. Pornpipatsakul K, Chuengwutigool W, Chaichaowarat R, Foongchomcheay A. Bridging exercise monitoring system using RGB camera for stroke rehabilitation. Presented at: 2023 IEEE R10 Conference (TENCON); Oct 31 to Nov 3, 2023. [CrossRef]
  37. Kushnir A, Kachmar O, Bonnechère B. STASISM: a versatile serious gaming multi-sensor platform for personalized telerehabilitation and telemonitoring. Sensors (Basel). Jan 6, 2024;24(2):351. [CrossRef] [Medline]
  38. Khanna P, Oppenheim T, Tu-Chan A, Abrams G, Ganguly K. Measuring arm and hand joint kinematics to estimate impairment during a functional reach and grasp task after stroke. Neurorehabil Neural Repair. Jun 2023;37(6):409-417. [CrossRef] [Medline]


FMA-UE: Fugl-Meyer Assessment for the upper extremity
ICC: intraclass correlation coefficient
LoA: limit of agreement
ROM: range of motion


Edited by Sarah Munce; submitted 27.Nov.2025; peer-reviewed by Ahmed ELMelhat, Jongsang Son; final revised version received 19.Mar.2026; accepted 09.Apr.2026; published 11.May.2026.

Copyright

© Hatem Lazem, David Harris, Abi Hall, Thomas C Richards, Phaedra Leveridge, Maedeh Mansoubi, Xiaohan Xu, Paul Newell, Sarah E Lamb, Helen Dawes. Originally published in JMIR Rehabilitation and Assistive Technology (https://rehab.jmir.org), 11.May.2026.

This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Rehabilitation and Assistive Technology, is properly cited. The complete bibliographic information, a link to the original publication on https://rehab.jmir.org/, as well as this copyright and license information must be included.